Search CORE

UCL Discovery

Stability of beta-titanium T-loop springs preactivated by gradual curvature

Author: Burstone CJ
Burstone CJ
Burstone CJ
Burstone CJ
Burstone CJ
Caldas SGFR
Caldas SGFR
Caldas SGFR
Chen J
Dalstra M
Earthman JC
Faulkner MG
Hanyuda A
Hazel RJ
Hudgins JJ
Lim Y
Lopez I
Marcotte M
Martins RP
Martins RP
Rose D
Silva Júnior R
William D
Wong EK
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Genome wide prediction of protein function via a generic knowledge discovery approach based on evidence integration

Author: A Drawid
A Lagreid
A Tanay
AC Gavin
AJ Enright
B Schwikowski
CJ Roberts
EM Marcotte
EM Marcotte
GD Bader
HJ Bussemaker
HW Mewes
I Cherel
J Ihmels
Jianghui Xiong
Kunyi Luo
LF Wu
M Ashburner
M Deng
M Deng
M Pellegrini
MB Eisen
MC von
MP Brown
OG Troyanskaya
P Jorgensen
P Uetz
PT Spellman
R Kohavi
R Overbeek
SF Altschul
Shanguang Chen
Simon Rayner
T Ito
TR Hazbun
TR Hughes
U Karaoz
WK Huh
WR Pearson
X Zhou
Y Chen
Y Ho
Yinghui Li
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The automation of many common molecular biology techniques has resulted in the accumulation of vast quantities of experimental data. One of the major challenges now facing researchers is how to process this data to yield useful information about a biological system (e.g. knowledge of genes and their products, and the biological roles of proteins, their molecular functions, localizations and interaction networks). We present a technique called Global Mapping of Unknown Proteins (GMUP) which uses the Gene Ontology Index to relate diverse sources of experimental data by creation of an abstraction layer of evidence data. This abstraction layer is used as input to a neural network which, once trained, can be used to predict function from the evidence data of unannotated proteins. The method allows us to include almost any experimental data set related to protein function, which incorporates the Gene Ontology, to our evidence data in order to seek relationships between the different sets. RESULTS: We have demonstrated the capabilities of this method in two ways. We first collected various experimental datasets associated with yeast (Saccharomyces cerevisiae) and applied the technique to a set of previously annotated open reading frames (ORFs). These ORFs were divided into training and test sets and were used to examine the accuracy of the predictions made by our method. Then we applied GMUP to previously un-annotated ORFs and made 1980, 836 and 1969 predictions corresponding to the GO Biological Process, Molecular Function and Cellular Component sub-categories respectively. We found that GMUP was particularly successful at predicting ORFs with functions associated with the ribonucleoprotein complex, protein metabolism and transportation. CONCLUSION: This study presents a global and generic gene knowledge discovery approach based on evidence integration of various genome-scale data. It can be used to provide insight as to how certain biological processes are implemented by interaction and coordination of proteins, which may serve as a guide for future analysis. New data can be readily incorporated as it becomes available to provide more reliable predictions or further insights into processes and interactions

An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae

Background: Probabilistic functional gene networks are powerful theoretical frameworks for integrating heterogeneous functional genomics and proteomics data into objective models of cellular systems. Such networks provide syntheses of millions of discrete experimental observations, spanning DNA microarray experiments, physical protein interactions, genetic interactions, and comparative genomics; the resulting networks can then be easily applied to generate testable hypotheses regarding specific gene functions and associations. Methodology/Principal Findings: We report a significantly improved version (v. 2) of a probabilistic functional gene network [1] of the baker's yeast, Saccharomyces cerevisiae. We describe our optimization methods and illustrate their effects in three major areas: the reduction of functional bias in network training reference sets, the application of a probabilistic model for calculating confidences in pair-wise protein physical or genetic interactions, and the introduction of simple thresholds that eliminate many false positive mRNA co-expression relationships. Using the network, we predict and experimentally verify the function of the yeast RNA binding protein Puf6 in 60S ribosomal subunit biogenesis. Conclusions/Significance: YeastNet v. 2, constructed using these optimizations together with additional data, shows significant reduction in bias and improvements in precision and recall, in total covering 102,803 linkages among 5,483 yeast proteins (95% of the validated proteome). YeastNet is available from http://www.yeastnet.org.This work was supported by grants from the N.S.F. (IIS-0325116, EIA-0219061), N.I.H. (GM06779-01,GM076536-01), Welch (F-1515), and a Packard Fellowship (EMM). These agencies were not involved in the design and conduct of the study, in the collection, analysis, and interpretation of the data, or in the preparation, review, or approval of the manuscript.Cellular and Molecular Biolog

Texas ScholarWorks

XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences

Author: A Heger
Aaron M Newman
AM Hauth
AP Garnet
BM Barney
C Hayashi
CJ Cummings
D Gatherer
D Sokol
E Gazit
EM Marcotte
G Benson
G Benson
GM Landau
HD Stahl
J Buard
James B Cooper
JM Berg
JM Hancock
K Inoue
KJ Verstrepen
M Gruber
M Pellegrini
MA Andrade
MK Kalita
ML Tierney
MV Katti
R Dickerson
R Szklarczyk
S Frey
X Qin
Y Goto
Y Goto
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed. Results To address limitations of current repeat identification methods, and to provide an efficient and flexible algorithm for the detection and analysis of TRs in protein sequences, we designed and implemented a new computational method called XSTREAM. Running time tests confirm the practicality of XSTREAM for analyses of multi-genome datasets. Each of the key capabilities of XSTREAM (e.g., merging, nesting, long-period detection, and TR architecture modeling) are demonstrated using anecdotal examples, and the utility of XSTREAM for identifying TR proteins was validated using data from a recently published paper. Conclusion We show that XSTREAM is a practical and valuable tool for TR detection in protein and nucleotide sequences at the multi-genome scale, and an effective tool for modeling TR domains with diverse architectures and varied levels of degeneracy. Because of these useful features, XSTREAM has significant potential for the discovery of naturally-evolved modular proteins with applications for engineering novel biostructural and biomimetic materials, and identifying new vaccine and diagnostic targets.</p

Identifying Genetic Dependencies in Cancer by Analyzing siRNA Screens in Tumor Cell Line Panels.

Author: A Chatr-Aryamontri
AL Jackson
CJ Lord
CJ Lord
EG Cerami
GS Cowley
HS Kim
HW Cheung
J Campbell
J Das
J Luo
JH Zhang
KC Helming
M Boutros
MB Yaffe
MS Lawrence
PV Hornbeck
R Brough
R Kelley
R Marcotte
R Moser
SA Forbes
T Davoli
T Hart
TY Hsu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Loss-of-function screening using RNA interference or CRISPR approaches can be used to identify genes that specific tumor cell lines depend upon for survival. By integrating the results from screens in multiple cell lines with molecular profiling data, it is possible to associate the dependence upon specific genes with particular molecular features (e.g., the mutation of a cancer driver gene, or transcriptional or proteomic signature). Here, using a panel of kinome-wide siRNA screens in osteosarcoma cell lines as an example, we describe a computational protocol for analyzing loss-of-function screens to identify genetic dependencies associated with particular molecular features. We describe the steps required to process the siRNA screen data, integrate the results with genotypic information to identify genetic dependencies, and finally the integration of protein-protein interaction data to interpret these dependencies

Institute of Cancer Research Repository

The YARHG Domain: An Extracellular Domain in Search of a Function

Author: AG Uren
Alex Bateman
C Cole
C Yeats
CJ Marcotte
D Kornitzer
EL Sonnhammer
EV Koonin
FE Rey
GM Cohen
H Takeshima
J Zupicich
JC Whisstock
KA Bidle
KA Michie
L Käll
L Tetsch
LS Johnson
M Ehrmann
M Magrane
M Sakoh
MA Dwyer
MJ Gubbels
MJ Wolin
MS Brown
ND Rawlings
Olivier Gires
P Bork
Penny Coggill
Q Cheng
Q Xu
RC Edgar
RD Finn
S Das
SF Altschul
SR Eddy
SS Krishna
V Kapatral
Publication venue: Public Library of Science
Publication date: 17/05/2012
Field of study

We have identified a new bacterial protein domain that we hypothesise binds to peptidoglycan. This domain is called the YARHG domain after the most highly conserved sequence-segment. The domain is found in the extracellular space and is likely to be composed of four alpha-helices. The domain is found associated with protein kinase domains, suggesting it is associated with signalling in some bacteria. The domain is also found associated with three different families of peptidases. The large number of different domains that are found associated with YARHG suggests that it is a useful functional module that nature has recombined multiple times

PESCADOR, a web-based tool to assist text-mining of biointeractions extracted from PubMed queries

Author: A Bairoch
A Barbosa-Silva
A Herbst
A Kashyap
A Renner
Adriano Barbosa-Silva
AJ Perez
AR Aronson
C Blaschke
C Perez-Iratxeta
C Plake
CE Moussa
CJ Gottardi
D Maglott
DA Benson
Elisa R Donnard
EM Marcotte
EW Sayers
Fernanda Stussi
FT Kolligs
H Shatkay
H Xie
I Iliopoulos
IF Tsigelny
J Bjorne
J Hur
J Miguel Ortega
Jean-Fred Fontaine
JF Fontaine
JF Fontaine
JM Olson
L Guo
M Chagoyen
M Miwa
MG Spillantini
Miguel A Andrade-Navarro
R Bunescu
R Hoffmann
R Leaman
S Dihlmann
S Matos
S Mika
SD Hooper
SK Halder
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

BACKGROUND: Biological function is greatly dependent on the interactions of proteins with other proteins and genes. Abstracts from the biomedical literature stored in the NCBI's PubMed database can be used for the derivation of interactions between genes and proteins by identifying the co-occurrences of their terms. Often, the amount of interactions obtained through such an approach is large and may mix processes occurring in different contexts. Current tools do not allow studying these data with a focus on concepts of relevance to a user, for example, interactions related to a disease or to a biological mechanism such as protein aggregation. RESULTS: To help the concept-oriented exploration of such data we developed PESCADOR, a web tool that extracts a network of interactions from a set of PubMed abstracts given by a user, and allows filtering the interaction network according to user-defined concepts. We illustrate its use in exploring protein aggregation in neurodegenerative disease and in the expansion of pathways associated to colon cancer. CONCLUSIONS: PESCADOR is a platform independent web resource available at: http://cbdm.mdc-berlin.de/tools/pescador

Open Repository and Bibliography - Luxembourg

MDC Repository

ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process

Author: CJ Stubben
D Barker
D Barker
D Haft
D Szklarczyk
DA Rodionov
Daniel H Haft
DH Haft
DH Haft
DH Haft
EM Marcotte
F Eckstein
F Enault
GV Glazko
H-Y Ou
J Sun
J Wu
J-P Vert
JAG Ranea
JD Selengut
JD Selengut
JD Selengut
Jeremy D Selengut
L Ferrer
M Csurös
M Huynen
M Pellegrini
MA Huynen
Malay K Basu
MS Gelfand
P Pagel
PM Bowers
PR Kensche
PS Dehal
R Jothi
RL Tatusov
S Briesemeister
S Freilich
SR Eddy
SV Date
SV Date
T Blum
T Gaasterland
T Xu
T Yamada
X Brazzolotto
Y Hong
Y Liu
Y Zhou
Z Jiang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Multiple Signals Converge on a Differentiation MAPK Pathway

Author: A Baudin
A Bender
A Pitoniak
AL Goldstein
AR Borneman
AR Colina
B Guo
B Miotto
Barbara Birkaya
BJ Andrews
BL Schneider
C Charizanis
C Schluter
CJ Bonangelino
CJ Gimeno
CJ Nobile
CJ Roberts
CJ Roberts
CM McDonald
Colin A. Chavel
D Kadosh
D Sinclair
D van Dyk
DA Lashkari
DB Doroquez
DC Hagen
DD Jenness
DJ Krysan
DM Gelperin
DM Rivers
E Boy-Marcotte
E De Nadal
EA Elion
EA Winzeler
EF Wagner
G Mas
G McCaffrey
GD Hurlbut
GM Santangelo
GW Carter
H Liu
HD Madhani
HD Madhani
HD Madhani
Heather M. Dionne
HJ Lo
HO Park
HU Mosch
HU Mosch
HU Mosch
I Dilova
I Laloux
J Chant
J Ogas
J Sambrook
JC Igual
JG Cook
JL DeRisi
JM Bean
JR Broach
JR Ferreira Junior
Jyoti Joshi
K Baetz
K Lemaire
K Nasmyth
K Rothfels
K Tatebayashi
KA Olson
KJ Barwell
KJ Verstrepen
KL Dunn
L Bardwell
L Bardwell
L Breeden
LO Murphy
LS Klig
LS Robertson
LS Robertson
M Qi
M Vidal
M Whiteway
MA Schwartz
MB Eisen
MC Lorenz
MC Yu
MD Rose
MG Lambrechts
Michael Snyder
MJ Carrozza
MJ White
MM Kasten
MS Longtine
MW Pfaffl
N Nakayama
N Vadaie
P Fabrizio
P Poullet
P Sass
Paul J. Cullen
PJ Cullen
PJ Cullen
PJ Cullen
PK Singh
PK Vinod
R Hasan
R Jin
RD Gietz
RD Gietz
RL Roberts
RR Barrales
S Chou
S Giannattasio
S Kuchin
S Kuchin
S Prinz
S Rupp
S Zaman
S Zaman
SE Rundlett
SK Kurdistani
SM O'Rourke
SP Palecek
SR Karunanithi
T Harashima
T Harashima
T Lechner
T Peeters
T Toda
TB Reynolds
TB Reynolds
TG Fazzio
TM Lamb
TS Kim
U Abdullah
V Voynov
VD Longo
W Niu
WP Voth
WS Lo
WS Lo
X Pan
Y Jia
Z Liu
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

An important emerging question in the area of signal transduction is how information from different pathways becomes integrated into a highly coordinated response. In budding yeast, multiple pathways regulate filamentous growth, a complex differentiation response that occurs under specific environmental conditions. To identify new aspects of filamentous growth regulation, we used a novel screening approach (called secretion profiling) that measures release of the extracellular domain of Msb2p, the signaling mucin which functions at the head of the filamentous growth (FG) MAPK pathway. Secretion profiling of complementary genomic collections showed that many of the pathways that regulate filamentous growth (RAS, RIM101, OPI1, and RTG) were also required for FG pathway activation. This regulation sensitized the FG pathway to multiple stimuli and synchronized it to the global signaling network. Several of the regulators were required for MSB2 expression, which identifies the MSB2 promoter as a target “hub” where multiple signals converge. Accessibility to the MSB2 promoter was further regulated by the histone deacetylase (HDAC) Rpd3p(L), which positively regulated FG pathway activity and filamentous growth. Our findings provide the first glimpse of a global regulatory hierarchy among the pathways that control filamentous growth. Systems-level integration of signaling circuitry is likely to coordinate other regulatory networks that control complex behaviors

CiteSeerX